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ABSTRACT 

The Library of Apicomplexan Metabolic Pathways 
(LAMP, http://www.llamp.net) is a web database 
that provides near complete mapping from genes 
to the central metabolic functions for some of the 
prominent intracellular parasites of the phylum 
Apicomplexa. This phylum includes the causative 
agents of malaria, toxoplasmosis and theileriosis— 
diseases with a huge economic and social impact. 
A number of apicomplexan genomes have been 
sequenced, but the accurate annotation of gene 
function remains challenging. We have adopted an 
approach called metabolic reconstruction, in which 
genes are systematically assigned to functions 
within pathways/networks for Toxoplasma gondii, 
Neospora caninum, Cryptosporidium and Theileria 
species, and Babesia bovis. Several functions 
missing from pathways have been identified, where 
the corresponding gene for an essential process 
appears to be absent from the current genome an- 
notation. For each species, LAMP contains inter- 
active diagrams of each pathway, hyperlinked to 
external resources and annotated with detailed infor- 
mation, including the sources of evidence used. We 
have also developed a section to highlight the overall 
metabolic capabilities of each species, such as the 
ability to synthesize or the dependence on the host 
for a particular metabolite. We expect this new 
database will become a valuable resource for funda- 
mental and applied research on the Apicomplexa. 



INTRODUCTION 

The Apicomplexa is a large phylum of obligate intracellu- 
lar eukaryotic protozoa, part of the Chromalveolata. They 
are defined by possessing an apical complex at their 
anterior end. The Apicomplexa originate from a photosyn- 
thetic predecessor and some members of the phylum retain 
a non-photosynthetic plastid called the apicoplast. The 
different branches of Apicomplexa occupy different ecolo- 
gical niches and possess varying metabolic capabilities 
owing to their early divergence from a common 
ancestor. It is estimated that Coccidia {Toxoplasma, 
Neospora, Eimeria, Sarcocystis, Cryptosporidium) 
separated from Aconoidasida (Plasmodium, Theileria and 
Babesia) ~705 million years ago (Mya) (1,2), although 
Douzery et al. calculated it as 495 Mya (2,3). These vari- 
ations are because of the sensitivity of different phylogen- 
etic classifications. The former estimation considers 
Cryptosporidia to be a branch of Coccidia, whereas 
Douzery et al. considered Cryptosporidia to be an inde- 
pendent group that separated from other apicomplexans 
around 580 Mya (2,3). The divergence time between the 
closely related apicomplexans Toxoplasma gondii and 
Neospora caninum is estimated to be about 28 Mya (4). 
These parasites present a considerable health and 
economic burden on society. Malaria, a disease of the 
tropical world, caused around 216 million cases of fever 
and 655 000 deaths in 2010 (5) and it is one of the major 
causes of poverty, especially in sub-Saharan Africa where 
it costs US $12 billion each year (6). T. gondii, which can 
infect any warm blooded animal, is found in around a 
third of human populations and can be fatal in 
immuno-compromised individuals. Toxoplasmosis can 
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be transmitted in utero through the placenta, causing con- 
genital toxoplasmosis in humans and other animals (7). 
Neosporosis and East Coast Fever are important bovine 
infections caused, respectively, by N. caninum (8) and 
Theileria parva (9) and lead to substantial losses in the 
dairy and beef industry. 

There are around 15 apicomplexan genomes sequenced 
so far from different branches of the phylum including 
several Plasmodium (10-15) species, T. gondii (16), 
N. caninum (4), Eimeria tenella (16), Babesia bovis (17), 
Cryptosporidium (18-21) and Theileria (22,23) species. 
These genomes and their annotations are available in 
EuPathDB (24) and GeneDB (25). The annotation of 
genes to biological functions remains challenging. The 
sole use of homology-based tools, such as Basic Local 
Alignment Search Tool (BLAST), can be insufficient and 
often produces incorrect assignments, especially where 
gene families exist. It is therefore important to perform 
careful and detailed functional annotation of these 
genomes to support both fundamental research and the 
development of therapeutics and vaccines. 

The association of genes to the specific role they play 
within metabolic networks or structural complexes helps 
to increase the precision of functional assignments, in a 
process known as 'metabolic reconstruction' first 
demonstrated in bacterial genomes such as Escherichia 
coli (26-28). Pinney et al. (29) demonstrated the need for 
a similar approach in the functional annotation of eukary- 
otic pathogens. Metabolic reconstruction approaches 
can identify incorrect annotations made by sequence 
homology-based methods and discover missing compo- 
nents of metabolic pathways. 

Metabolic reconstruction can be carried out either auto- 
matically or more slowly by manual curation. Automated 
resources available for the Apicomplexa include the 
Kyoto Encyclopedia of Genes and Genomes (KEGG) 
(30), ApiCyc (31) and MetaTIGER (32). The Malaria 
Parasite Metabolic Pathways (MPMP) resource exists 
for the intra-erythrocyte stage of Plasmodium falciparum 
(33). It is the only manually curated web database avail- 
able for any apicomplexan and is considered to be a gold 
standard resource (34). The use of MPMP and other auto- 
matically generated resources for the genome-scale meta- 
bolic reconstruction and metabolic flux-balance analysis 
of P. falciparum by Plata et al. (35) and Huthmacher 
et al. (36) demonstrate the importance of well-curated re- 
sources for systems-level analysis and computational 
evaluation of drug targets. MPMP pathways are recon- 
structed on the basis of having at least three to four 
enzymes acting consecutively in a pathway and having 
been annotated in the genome (33,37). A comparison of 
P. falciparum metabolic pathways from automatic recon- 
struction resources with MPMP showed that the auto- 
mated resources are highly error prone (37). The pitfalls 
of PlasmoCyc (part of ApiCyc) include that it has a larger 
number of polypeptides than predicted in the official 
genome release at PlasmoDB, as it was built from an 
earlier gene model release, more than half of these 
pathways do not meet the definition of a metabolic 
pathway and there are large numbers of pathway holes 
(37). In addition, some of the pathways that were 



experimentally proven to be absent, such as the 
mevalonate pathway of isoprene biosynthesis, have been 
included (37,38). In KEGG, annotated paralogs of 
enzyme coding genes are missing and many of the 
pathways are incomplete with fewer genes annotated to 
functions. The percentage of pathways overlapping in 
MPMP with KEGG is also <25% (37). 

The problems associated with automatic reconstruction 
resources reported for P. falciparum are relevant for 
other Apicomplexa including T. gondii. The goal of the 
Library of Apicomplexan Metabolic Pathways (LAMP) 
web database is to provide a curated resource for all 
sequenced Apicomplexa of medical and veterinary import- 
ance, except for Plasmodium, which is covered by MPMP. 
LAMP contains metabolic pathways constructed in a semi- 
automatic process, in which information from genome 
databases and experimental evidence available in the 
scientific literature has been integrated. In addition, 
sequence homology searches, protein functional motif iden- 
tification, information on organelle localization, experi- 
mental proteome identification and biochemical/ 
physiological evidence have all been used for metabolic 
pathway creation (Supplementary File 1 and Figure SI). 
The curated metabolic pathway annotations and all 
associated data can be freely accessed without registration 
by any individual through the http://www.llamp.net portal. 

WEBSITE ORGANIZATION AND CONTENT 

This website is divided into four main sections 
(Figure 1C). The introductory section provides a general 
overview of metabolic reconstruction, the methods used in 
the analysis and the guidance for understanding metabolic 
pathways. It also presents a comparative overview of 
metabolic capabilities present in different apicomplexan 
species. The second section is Toxoplasma and Neospora, 
which presents the pathways for T. gondii and 7Y. caninum 
grouped under the metabolic pathway super families of 
carbohydrate and energy metabolism, amino acids 
metabolism, lipids and glycan metabolism, nucleotide me- 
tabolism, vitamins, cofactors and other substrates metab- 
olism and other organellar pathways. The third and fourth 
sections are for the metabolic pathways of Cryptosporidia 
(Cryptosporidium muris, Cryptosporidium parvum and 
Cryptosporidium hominis) and Piroplasma (B. bovis, 
T. parva and Theileria annulata) species and are organized 
in the same way as Toxoplasma and Neospora. These 
organism-specific sections reflect the organization of 
genomes in different databases, ToxoDB (16), CryptoDB 
(21) and PiroplasmaDB (24), in EuPathDB. 

The pages of each metabolic pathway have four main 
sections in general. They comprise a literature overview, 
table of gene annotations, metabolic pathway diagram 
and a table containing source and fate pathways of metab- 
olites (Figure 1 D-G). The literature overview presents 
a textual description of the pathway we have written, 
describing the importance of the metabolic capability and 
any biochemical/physiological evidence available for the 
organism or its close relatives. It also includes links to 
the PubMed pages of relevant publications, thus acting 
as source of detailed information for users wishing to 
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Lipoic acid metabolism 

Lipoic acid is an essential cofactor of dehydrogenase enzymes, mainly the E 2 subunit of pyruvate dehydrogenase, 
branched-chain keto-acid dehydrogenase and 2-oxoglutarate dehydrogenase complexes. Lipoic acid is an organic 
sulphur compound derived from the 8-carbon fatty acid, octanoic acid. In most eukaryotes, lipoic acid is 
synthesised in mitochondrion and it is consistent with the localisation of above mentioned three complexes in 
mitochondria. In apicomplexan parasites, pyruvate dehydrogenase complex is localised in the plastid organelle, 
apicoplast. As the de novo fatty acid biosynthesis via FAS II pathway f fatty acid biosynthesis in the apicoplast] 
occurs in apicoplast, it is possible to suggest that Toxoplasma gondii can de novo synthesise lipoic acid in the 
apicoplast. Although other dehydrogenases are located in mitochondria, de novo fatty acid biosynthesis does not 
take place in apicomplexan mitochondrion. The work by Crawford et al in T. gondii demonstrated that lipoylation 
of mitochondrial proteins is reduced when the parasites are grown in lipoic acid-deficient medium and no effect was 
seen with lipoylation of apicoplast proteins. This effect in mitochondrion has also shown to be reversed with 
exogenous lipoic acid supply [i]. This clearly shows that apicoplast synthesises lipoic acid de novo, whereas 
mitochondrion salvages it from host. The plasma membrane and mitochondrial transporter which are involved in 
lipoic acid salvage and mitochondrial localisation are not yet identified in T. gondii genome. The lipoic acid 
metabolism pathway for Plasmodium falciparum in MPMP is identical to pathway here and Plasmodium also 
follows the biosynthesis and salvage mechanisms as in T. gondii. 
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Figure 1. Screen shot of T. gondii lipoic acid metabolism pathway page in LAMP. (A) The general search box. (B) This block shows the specific 
search option, which allows the search to be carried by choosing one or more of the parameters such as organism, EC number, gene ID and pathway 
name. (C) The blocks showing the four main sections of the website. (D) Introductory text of the metabolic pathway page. (E) The enzyme 
annotation table with enzyme names, EC numbers, annotated gene IDs, any available localization evidence and the source of localization 
evidence. The EC numbers are linked out to ExPASy and the gene IDs are linked out to respective pages in ToxoDB. (F) The metabolic 
pathway diagram is present in a scrollable window with a link to open the diagram in a new window. The apicoplast-based de novo biosynthesis 
branch is visible in the screen shot. (G) The table showing the substrates and products of the pathway with their origin and fate pathways. The 
pathway names have hyperlinks to the respective pathway pages. (H) The block for signing up or signing in to the website, required for adding 
comments and curation and not for accessing pathways or data downloads. 119 x 219 mm (300 x 300 DPI). 
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gain a summary of past research. The annotation table 
comprises the list of enzymes/proteins in the pathway 
with their full or partial Enzyme Commission (EC) 
numbers available and the gene ids. If any enzyme 
required for the completion of the pathway is missing, it 
is noted in the table with 'missing in annotation' in the 
column for gene ids. The full EC numbers and gene ids 
are linked to their respective pages in the enzyme 
database (39) of ExPASy (40) and EuPathDB, respectively. 
In addition, the organelle localization and the source of 
the localization (with links to the source) are provided 
for T. gondii, if available. The organelle localization for 
other species will be added in future releases as and when 
more experimental data becomes available. One of the 
main limitations of existing resources such as KEGG and 
metaTIGER is that they do not provide enzyme names 
and other metabolites such as cofactors in the metabolic 
maps. In LAMP, these are taken into account and the 
cofactors are represented with correct stoichiometry in 
our pathways. The maps also include enzyme names with 
EC numbers. Hyperlinks to KEGG compounds are 
provided for each metabolite to allow users to access 
the chemical formulas and structures. The enzymes in the 
pathway are hyperlinked to the KEGG enzyme database, 
where the full EC number is assigned. When the full EC 
number is not assigned for a particular enzyme, the links 
are provided to the KEGG ontology database. The 
presence of hyperlinks for each reaction to the respective 
page of KEGG reactions will help distinguish the alterna- 
tive reactions catalysed by the same enzyme. The pathways' 
diagrams can be viewed either in the annotation page 
with the use of scrollable bar or in a new window. 
Upstream pathway(s) (sources of the pathway precursors) 
and downstream pathways (ways of utilization of the path- 
way's products) are shown in an accompanying table. The 
pathways thus provide a mechanism by which users can 
learn which metabolites can be produced de novo by 
the parasite and which require uptake from the host. 



Such information can be helpful in the search for new 
therapeutic targets. 

LAMP can be searched through two different 
mechanisms. The general search box (Figure 1A) allows 
the retrieval of data via text mining and therefore the 
output of the search will provide links to all pages where 
the searched term is present. The advanced search option 
(Figure IB) allows the retrieval of gene annotations by 
choosing one or more of the parameters from drop 
boxes (Supplementary Figure S2), querying the underlying 
relational database of annotations only and not the 
textual descriptions of pathways. 

The core metabolic pathways for the eight species men- 
tioned are available in LAMP. Where possible, it also 
includes a comparison with MPMP for P. falciparum. 
By design, annotations of proteins to other biological pro- 
cesses such as DNA replication, DNA damage repair, 
RNA degradation and translation are not present, along 
with enzymes that do not fit into metabolic pathways such 
as proteases and peptidases for which more traditional 
forms of functional annotation may be appropriate. 
The genes annotated to metabolic pathways in LAMP 
constitutes around 7-10% of total protein coding genes 
of the organisms (Table 1). The number of missing 
enzymes in the annotation for T. gondii is low (20) and 
approximately the same as that for P. falciparum in 
MPMP (19), suggesting that the accuracy of metabolic 
reconstruction and the underlying gene models for these 
species are both high. 

We have performed a survey to summarize the missing 
enzymes, which is available on the website (http://www. 
llamp.net/?q = Missing%20enzymes). Some enzymes are 
missing in all apicomplexans, including glucosamine- 
phosphate /V-acetyltransferase, acyl-Acyl Carrier Protein 
(ACP) thiolesterase, phosphatidylglycerophosphatase 
and 3-octaprenyl-4-hydroxybenzoate carboxy-lyase. 
The enzymes dolichol kinase and dolichylphosphatase 
are only missing in Coccidians — T. gondii and 



Table 1. A survey of the data available for the different apicomplexan genomes in LAMP 



Organism 


Total number of 


No. of 


No. of unique 


No. of unique 


No. of total 


No of missing 




protein coding genes 11 


pathways 


EC numbers' 3 


enzymes 0 


genes in pathways 


enzymes' 5 


P. falciparum' 


5418 


42 


294 


316 


520 


19 


T. gondii ME49 


7934 


51 


386 


417 


666 


20 


N. caninum 


7080 


51 


381 


41 1 


659 


26 


C. muris 


3934 


31 


208 


224 


302 


15 


C. parvum 


3805 


28 


191 


207 


270 


10 


C. hominis 


3886 


28 


184 


200 


261 


17 


B. bovis 


3706 


32 


203 


216 


322 


11 


T. parva 


4082 


32 


199 


213 


323 


17 


T. annulata 


3795 


32 


200 


214 


313 


16 



"Total numbers of protein coding genes for Apicomplexa are obtained from the respective databases of EuPathDB (24). 

b Unique EC numbers represents the total number of unique enzyme activities with full EC numbers assigned by IUBMB present in the metabolic 
pathways annotated. 

"Unique Enzymes represents total unique enzyme activities annotated to be present in the pathways for an organism. This includes enzyme functions 
with full and partial EC numbers and without EC number annotations. 

d Missing enzymes represents the enzymes need to be present to complete the metabolic pathways. They may either be missing in the gene model 
predictions or may be absent in the organism. 

The metabolic pathways for P. falciparum are not present in LAMP. The numbers for P. falciparum do not reflect the total number of genes 
annotated to all pathways in MPMP. but only those genes annotated to core metabolic functions. 
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N. caninum. Dolichyl-phosphate-mannose-glycolipid 
alpha-mannosyltransferase is missing in all Piroplasma 
species and adenosine deaminase is missing in Theileria 
genomes. These missing enzymes could be owing to a 
novel enzyme or branch yet to be discovered in these 
pathways. There is also a set of enzymes that is 
randomly missing in one or two species, suggesting 
errors in the current gene models. These include inosine 
monophosphate (IMP) dehydrogenase, asparagine-linked 
glycosylation (ALG5) and ALG11 in C. muris, 
phosphopantothenoylcysteine decarboxylase in T. parva 
and 6-pyruvoyltetrahydropterin synthase in N. caninum. 
The complete set of metabolic pathway annotations and 
the metabolic pathway maps of these organisms are avail- 
able in Microsoft Excel and Systems Biology Markup 
Language (SBML) files for download at the website 
(http://www.llamp.net/7q = Downloads). In addition, 
detailed comparison of metabolic capabilities between 
these apicomplexans and comparison with human metab- 
olism is also available in the 'downloads' section, which 
we anticipate should be helpful in identifying putative 
novel drug targets. 

IMPROVEMENTS IN METABOLIC 
RECONSTRUCTION WITH MANUAL CURATION 

The combined use of bioinformatics resources and the 
wealth of information available in the literature has 
improved the annotation of individual enzyme functions 
or led to the validation of overall pathways in LAMP. 
The majority of pathways present in LAMP are at least 
partly supported by experimental evidence in either 
T. gondii or P. falciparum, and a process of manual 
curation has been performed on every pathway. Curation 
tasks typically involve reading the scientific literature, 
checking for pathway holes, identification of mistaken 
automated assignments and writing textual summaries 
for each pathway to give users confidence that the 
pathway is a true reflection of the known biology of the 
organism. In the following section, we provide some 
examples of pathways in which manual curations have led 
to improvements. 

An example of validation of a metabolic pathway with 
biochemical evidence is lipoic acid metabolism in 
T. gondii. The dissection of the T. gondii gene models sug- 
gested the presence of enzymes involved in both de novo 
lipoic acid synthesis (lipoyl-ACP:protein AMipoyl transfer- 
ase and lipoic acid synthase) and salvage (lipoate-protein 
ligase). The work by Crawford et al. (41) in T. gondii 
demonstrated that lipoylation of mitochondrial proteins 
is reduced when the parasites are grown in lipoic acid- 
deficient medium and no effect was seen with lipoylation 
of apicoplast proteins. This effect in mitochondria has also 
shown to be reversed with exogenous lipoic acid supply. 
This suggests the presence of apicoplast-based synthesis 
and mitochondrial salvage pathways. 

Aromatic amino acid hydroxylase and phosphati- 
dylserine synthase are examples of enzymes for which 
annotation of substrate specificity and enzyme function 
were improved using biochemical evidence. The two 
genes encoding an aromatic amino acid hydroxylase 



enzyme were annotated as either phenylalanine 
4-hydroxylase (EC number: 1.14.16.1) or tyrosine 
3-monooxygenase (EC number: 1.14.16.2) in different re- 
sources. However, it has been demonstrated experimen- 
tally that the two enzymes can accept both 
phenylalanine and tyrosine as substrates, although with 
higher preference for tyrosine (42). In LAMP, we have 
therefore annotated the two genes with both functions. 

A further example of manual curation performed in 
LAMP involves phosphatidylserine, which can be 
synthesized either from cytosine diphosphate (CDP)- 
diacylglycerol or from phosphatidylethanolamine. In 
MPMP, the P. falciparum phosphatidylserine synthase 
has been annotated as CDP-diacylglycerol dependent (EC 
number: 2.7.8.8). Conversely, published enzyme assays in 
T. gondii have allowed us to improve the EC number anno- 
tation in LAMP, as a study by Gupta et al. (43) showed the 
presence of base-exchange-dependent activity from 
phosphatidylethanolamine (EC number: 2.7.8.29) rather 
than CDP-diacylglycerol-dependent activity. 

The pathways that have been added without any bio- 
chemical evidence and with missing enzymes for T. gondii 
are lysine biosynthesis and threonine biosynthesis 
pathways; these were added on the basis of the presence 
of predicted genes for the majority of enzymes and having 
proteomics evidence for at least one of the enzymes. 
It remains an open question as to whether T. gondii can 
synthesize lysine and threonine, as we have searched both 
the gene models and the raw genome sequence for 
evidence of the missing functional domains required 
to complete these pathways and, as yet, found no 
evidence. It remains possible that there are novel enzym- 
atic functions or new pathway branches still to be 
discovered. 

UNDERSTANDING HOST-PARASITE BIOLOGY 
FROM METABOLIC PATHWAYS 

The comparative analysis of metabolic capabilities 
between different Apicomplexa suggests that the variations 
are attributable to the different ecological niches they 
occupy and the divergent environmental stresses they 
undergo. The Coccidians T. gondii and N. caninum 
possess greater metabolic capabilities than other 
Apicomplexa species, which is essential for the generalist 
life style of these Coccidia infecting almost any cell type. 
P. falciparum does not have some of the capabilities 
present in Toxoplasma, the main ones being synthesis 
of some of the amino acids, as they can salvage these 
from haemoglobin digestion (Figure 2). The absence of 
genes for lysine and threonine biosynthesis pathways in 
monoxenic Coccidia E. tenella (pathways to be added 
to LAMP in a later release) may be suggestive of the 
stress T. gondii and N. caninum undergo in the tissue 
cyst (bradyzoite) stage. The presence of starch metabolism 
(polysaccharide storage) and trehalose synthesis pathways 
in Coccidia and Cryptosporidia is indicative of the chal- 
lenges oocysts face in the external environment. The 
favourable host environment of Cryptosporidia (epithelial 
cells of the small intestine in C. parvum and C. hominis and 
the gastric glands of the stomach in C. muris) and the 
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Figure 2. A schematic diagram of Apicomplexa illustrating the differences in the metabolic capabilities of T. gondii and P. falciparum. The metabolic 
capabilities present in T. gondii and absent in P. falciparum are shown in red, and the capabilities present in P. falciparum and absent in T. gondii are 
in blue. The pathways present in both are not shown. Although P. falciparum can synthesize thiamine de novo, it also salvages it from host. The 
pathways are shown in the organelles where they are predicted to occur in the Apicomplexa. If part of a pathway is predicted to occur in an organelle 
and another part in the cytosol, it is shown in midway between the organelle and cytosol. The complete comparison of metabolic capabilities of these 
species and other species in LAMP is available at http://www.llamp.net/7q = Apicomplexan%20comparison. 53 x 38 mm (300 x 300 DPI). 



absence of formation of the parasitophorous vacuole in 
Piroplasma have led to reduced metabolic capabilities in 
these species including the synthesis of many amino acids, 
vitamins and cofactors. A striking difference between 
C. parvum/C. hominis and C. minis is the absence of 
Tricarboxylic acid (TCA) cycle in the former species, 
likely due to the increased nutrient availability in the 
small intestine. An integrated Apicomplexa database 
such as LAMP will help understand these differences 
and provide a community tool for selecting both 
organism-specific and general drug targets. 

DISCUSSION 

By providing an integrated metabolic pathway database 
for apicomplexan parasites, we aim to provide the 
medical, veterinary and scientific community with a 
resource to study the biochemistry of these parasites and 
to investigate the host-parasite biology. This resource will 
also be useful for researchers working on flux-balance 
analysis and metabolomics of these apicomplexan para- 
sites in addition to providing interesting targets for 
enzyme characterization and drug development. We 
believe that the inclusion of biochemical evidence and 



links from genes to the regularly updated EuPathDB 
gene models will provide users considerable added value 
beyond those data provided in complementary resources. 
Links back from EuPathDB gene models to the pathways 
in LAMP are currently under development and will be 
available in an up-coming release. The development of 
LAMP does not only provide a metabolic pathway 
resource for Apicomplexa but also serves as an exemplar 
for other annotation projects to follow. For example, there 
are many more parasites of medical and veterinary signifi- 
cance such as Entamoeba, Kinetoplastids and parasitic 
helminths with genomes currently being sequenced that 
would be appropriate for following a similar methodology. 

The P. falciparum resource MPMP is of high quality, 
with much effort given to manual curation. It is an 
organism-specific resource, and the usefulness of it to 
wider Apicomplexa community is limited. LAMP 
provides a balance between a highly curated organism- 
specific resource and general automatic reconstruction 
resource such as KEGG. The metabolic reconstruction 
of T. gondii was performed through a hugely detailed 
manual curation effort and thus we believe pathways 
and mappings are accurate to a high level. The pathways 
for Cryptosporidia and Piroplasma were developed using 
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the P. falciparum and T. gondii pathways as a framework, 
with less manual eviration than T. gondii because there is 
less transcriptomics, proteomics and biochemical evidence 
available for these organisms. However, we believe these 
pathways are still accurately curated to a level beyond 
another similar resource based purely on automated 
reconstruction. 

We also encourage the active participation of the 
apicomplexan research community. Although access to 
the pathway or the download of files does not require 
any registration, researchers with specialist knowledge in 
Apicomplexa biochemistry will be able to contribute to the 
curation by freely registering to the website with an insti- 
tutional email address. Researchers can also make 
comments on annotation and metabolic pathways 
through registration. This database is still under active 
development and improvements will be applied in 
response to user feedback. In conclusion, we believe that 
LAMP acts as an important resource for researchers 
wishing to understand the metabolic capabilities of this 
important group of parasites. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary File and Supplementary Figures 1 and 2. 
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